Blood-biomarkers and devices for atrial fibrillation screening: Lessons learned from the AFRICAT (Atrial Fibrillation Research In CATalonia) study

Background and objective AFRICAT is a prospective cohort study intending to develop an atrial fibrillation (AF) screening program through the combination of blood markers, rhythm detection devices, and long-term monitoring in our community. In particular, we aimed to validate the use of NT-proBNP, and identify new blood biomarkers associated with AF. Also, we aimed to compare AF detection using various wearables and long-term Holter monitoring. Methods 359 subjects aged 65–75 years with hypertension and diabetes were included in two phases: Phase I (n = 100) and Phase II (n = 259). AF diagnosis was performed by baseline 12-lead ECG, 4 weeks of Holter monitoring (NuuboTM), and/or medical history. An aptamer array including 1310 proteins was measured in the blood of 26 patients. Candidates were selected according to p-value, logFC and biological function to be tested in verification and validation phases. Several screening devices were tested and compared: AliveCor, Watch BP, MyDiagnostick and Fibricheck. Results AF was present in 34 subjects (9.47%). The aptamer array revealed 41 proteins with differential expression in AF individuals. TIMP-2 and ST-2 were the most promising candidates in the verification analysis, but none of them was further validated. NT-proBNP (log-transformed) (OR = 1.934; p<0.001) was the only independent biomarker to detect AF in the whole cohort. Compared to an ECG, WatchBP had the highest sensitivity (84.6%) and AUC (0.895 [0.780–1]), while MyDiagnostick showed the highest specificity (97.10%). Conclusion The inclusion and monitoring of a cohort of primary care patients for AF detection, together with the testing of biomarkers and screening devices provided useful lessons about AF screening in our community. An AF screening strategy using rhythm detection devices and short monitoring periods among high-risk patients with high NT-proBNP levels could be feasible.


Introduction
Atrial fibrillation (AF) is one of the most common cardiac arrhythmias in the general population and its incidence and prevalence are increasing globally, becoming an important public health problem [1]. Treatments exist to prevent the most important complications of AF as is the case of oral anticoagulants (OACs) for stroke risk reduction [2]. However, this condition is frequently asymptomatic and paroxysmal, difficulting its diagnosis. As a consequence of the failure to detect AF at early stages, 10% of strokes are caused by previously unknown AF [3]. Screening strategies would potentially increase AF detection rates and reduce its complications, but the best screening approach is not clear [4].
During recent years, a diversity of devices and new methodologies for AF detection have appeared, ranging from single time-point to long-term monitoring devices capable to identify brief asymptomatic AF episodes [5]. Although intensive monitoring periods increase the AF diagnostic yield [6], screening strategies in the primary care setting require inexpensive and cost-effective methodologies.
The incorporation of blood biomarkers into AF screening strategies is gaining interest. It represents an opportunity to enlarge the window for AF detection as some of them might present a kind of "biological memory" being elevated even outside AF episodes. A promising biomarker in this context is N-terminal pro B-type natriuretic peptide (NT-proBNP), which we recently found elevated in AF individuals, even in paroxysmal cases [7]. However, the usefulness of this biomarker needed to be confirmed and potentially complemented by others.
In the present work, we aimed to explore several tools to be applied in an AF screening program in the primary care health setting. In particular, we aimed to validate the use of NT-proBNP, and identify new blood biomarkers associated with AF. Also, we aimed to compare AF detection using various wearables and long-term Holter monitoring.

Study population
AFRICAT (Atrial Fibrillation Research in CATalonia; NCT03188484) is a prospective, multicenter, population-based screening study for AF. The study was divided into two phases: Phase I (2016-2017) for discovery and verification, and Phase II (2019-2020) for validation (Fig 1). Individuals aged 65-75 with a registered diagnosis of hypertension and diabetes were identified from the clinical records of three different Catalonia health areas (SAP Muntanya, SAP Reus, and SAP Terres de l'Ebre) and invited to participate in the study. Individuals with chronic inflammatory diseases, active cancer, or dementia were excluded. In Phase II, patients with a previous diagnosis of AF were also excluded. At a baseline visit, patients received a comprehensive assessment consisting of clinical characteristics (demographic factors, vascular risk factors, medications, comorbidities, and vitals), and electrocardiography assessment. At the same baseline visit, several AF detection devices were tested (Watch BP, MyDiagnostick, and AliveCor or Fibricheck), and blood was collected. Moreover, patients received a wearable Holter device (Nuubo TM ) and were instructed by local trained researchers to wear it for 4 weeks as described previously [7]. AF diagnosis was performed by a baseline 12-lead electrocardiogram (ECG), 4 weeks monitoring with a wearable Holter device (Nuubo TM ), and/or medical history. On Holter monitoring, AF was defined as irregular R-R intervals without a P wave signal, lasting for more than 60s. Expert cardiologists blinded to clinical characteristics evaluated the anonymized Holter records and ECGs to identify AF episodes. Holter monitoring was optional in the patients with previous diagnosis of AF included in Phase I. More information about the study protocol has already been published [7].
The AFRICAT study protocol was approved by the clinical research ethics committees of IDIAP Jordi Gol (P15/047) and Hospital Universitari Vall d'Hebron [PR (AG) 133-2015]. All participants signed a written informed consent before inclusion. The study protocol conformed to the ethical guidelines of the 1975 Declaration of Helsinki.

Biomarker quantification
Blood was collected into EDTA and serum tubes at the time of inclusion. After centrifugation at 1500 g and 4˚C for 15min, plasma and serum aliquots were frozen at −80•C until biomarker determination.

Discovery study-Somascan.
To identify proteins differentially expressed in AF individuals, an experimental design including discovery, verification, and validation phases was followed. For the discovery study, protein levels in plasma were assessed using the SOMAscan1 platform (SomaLogic Inc., Boulder, CO, USA), an aptamer-based proteomic assay measuring 1305 proteins simultaneously [8]. Normalization and calibration procedures were performed by SomaLogic according to their protocol and final data were reported in relative fluorescent units (RFU) [9]. The most promising candidates from this experiment (according to the best p-value, logFC and, plausible pathophysiological role) were verified in the whole Phase I subjects, and subsequently, the ones with nominal p-value <0.1 were selected to be validated in Phase II.
All assays were performed blinded to clinical information and according to the manufacturer's instructions. All samples were tested in duplicate and inter-assay variation was determined by a commercial control (Human Serum, male AB, USA origin from clotted, SIGMA, ref number H16914; Human plasma K2 EDTA, Innovative Research, ref number IPLA-N) tested in duplicate in each plate. Those samples with duplicated values showing a coefficient of variation (CV) higher than 20% were removed. When inter-assay variation was >20%, biomarker levels were standardized. In each separate phase, samples were randomized through the plates according to AF diagnosis to have the same % of cases in each plate. Therefore, standardization was performed, when needed, dividing all the sample's concentrations by a coefficient calculated as the plate-specific median between the overall plates median. The commercial control sample was used as a reference to standardize the results for the joined analysis of Phase I and Phase II.

Devices
At the baseline visit, AF detection devices were tested. At phase I AliveCor, Watch BP, and MyDiagnostick were used. Due to a high rate of unclassified cases (13.3%), AliveCor was replaced by Fibricheck in Phase II (Fig 2).
AliveCor KardiaMobile (AliveCor, USA) [10] is a smartphone-based device that consists of an electrocardiographic recorder of one derivation connected to a mobile phone application.
MyDiagnostick (Applied Biomedical Systems, Netherlands) [11] is a single-lead ECG recorder with the shape of a stick with metallic handles (electrodes) at both ends. After a minute of holding the device with both hands, an indicator (green/red) of possible irregular pulse appears. It operates without additional hardware and can record and store over 100 ECG's. Microlife WatchBP Home monitor (MicroLife, USA) [12] is an oscillometric digital blood pressure monitor device, which has an algorithm for AF detection based on the regularity of pulses. It automatically takes three sequential measurements to detect possible AF. Fibricheck (Fibricheck, Belgium) [13] is a mobile phone app based on photoplethysmography (PPG) technology. It uses the flashlight and the camera of the mobile phone to measure the changes in blood flow and calculate the heart rate.
For each device, binary information about rhythm (rhythmic vs. arrhythmic) was recorded. Specificity, sensitivity, positive predictive value (PPV), negative predictive value (NPV), and area under the ROC curve (AUC) were calculated for each device in comparison to ECG results. For the analysis, unclassified device records were considered as No AF.

Statistical analysis
Statistical analyses were conducted with SPSS version 20 and R software version 3.6.3. Data were expressed as number (%) for categorical variables and as mean ± SD or median (interquartile range) for continuous variables, depending on its distribution. For univariate analysis, the Mann-Whitney U-test or Student's t-test was used for continuous variables, and the X 2 test was used for categorical variables.
The number needed to screen was calculated dividing the number of patients included, by the number of new AF cases detected.
For the discovery experiment, p-values were corrected using false discovery rate (FDR), and base 2 logarithmic fold-changes (logFC) were calculated for each protein by subtracting abundance logarithmic values of the AF to the non-AF samples (logFC = log [AF mean/no AF mean]). The Spearman test was used for correlations. Comparisons were first performed between AF patients vs no AF patients, and second between Holter-detected AF vs no AF patients (excluding AF patients diagnosed with other methodologies). Biomarker levels were log-transformed and forward stepwise logistic regression was used to select the ones that independently predicted AF in the whole cohort. Binary logistic regression analyses were performed including the most promising biomarkers and clinical variables of interest (sex, age, heart failure, ischemic cardiopathy, and valvular disease).
The sample size needed for Phase I was calculated aiming to detect a minimum of 10 cases of AF to perform a biomarker case-control discovery experiment and further verification. According to unpublished data from the AFABE study [14], the prevalence of AF in patients with hypertension and diabetes in our community was 9.6%, therefore, a sample of 100 patients was included in phase I. Then, for Phase II sample size was estimated based on phase I biomarker results (power of 80%, α = 0.05) (Ene 3.0, GlaxoSmithKline, UK). According to these calculations, the sample size for Phase II should be between 60, 213, or 282 depending on the biomarker data used for sample size calculation (NT-proBNP, ST-2, or TIMP-2, respectively).

Patient characteristics and AF detection
100 patients were included in Phase I and 259 in Phase II (Fig 1). The number needed to screen to detect one new AF case in our study was 14.96. Clinical characteristics of the cohort can be found in Table 1.
In general, anticoagulation treatment was more frequent in the AF group (p<0.001). Also, there were some clinical differences between the two phases. In the no AF group, older age (p = 0.015), female sex (p<0.0001), heart failure (p = 0.004), and higher levels of systolic blood pressure (p = 0.004) were more common in Phase II in comparison to Phase I, while smoking (p = 0.010) was more common in Phase I. In the AF group, anticoagulant treatment (p = 0.011) and coronary heart disease (p = 0.005) were more common in Phase I. Median monitoring time was 506 hours (IQR 267h-600h). Holter monitoring was not evaluable in 16 patients, and 44 had short monitoring periods. Therefore, patients with poor Holter records (<100 h registered and/or with artifacts) were excluded from the biomarker analysis to avoid misclassifications (Fig 1). All the patients had a baseline ECG done and therefore were included in the device's analysis.
AF was presented in a total of 34 subjects (24 newly detected within the present study). AF cases were classified according to how AF diagnosis was performed in 4 groups, defined by past medical history (PMH) for AF, ECG findings, and Holter AF detection as follows:  PMH-ECG-Holter+ (Holter-detected AF), PMH-ECG+Holter+, PMH+ECG+, and others. According to this classification, 17 individuals were only diagnosed during the Holter monitoring period (Holter-detected AF), 7 in Phase I and 10 in Phase II. From these, 82.35% had AF episodes during the first 7 days, and 88.23% during the first two weeks (Fig 3). 7 individuals, 4 in Phase I, and 3 in Phase II, had a new-onset AF on baseline ECG. In all those patients, AF was also found in the Holter register. 7 individuals in Phase I had previous medical history of AF, confirmed by ECG in the baseline visit, and during the Holter monitoring period in those who were followed-up (monitoring in this group was optional). Finally, as others, we classified those patients with an external di-agnosis of AF, previous to the study or during the study, not detected by the ECG or the Holter in the present study.

Biomarker analysis
3.2.1. Discovery. For the discovery study, a subset of 13 AF patients (from which, 7 Holter-detected AF), and 13 no AF patients with long monitoring registers (>450h) were selected from the whole Phase I.

PLOS ONE
In this subset, patients with AF tended to be younger (p = 0.087) and have higher rates of coronary heart disease (p = 0.073) than no AF patients. Besides, anticoagulation treatment was more common in the AF group (p = 0.039), and antiplatelet treatment was more common in the no AF group (p = 0.006). The other variables were similar between the two groups.
From the evaluated 1305 proteins, 41 proteins had differences between the two groups (nominal p-value<0.05) (S1 Table): 19 had higher levels in patients with AF, and 22 in patients without AF. No protein remained significant after correction by multiple comparisons. 11 candidates were chosen because of the best p-value, logFC and, plausible pathophysiological role to be tested in the whole Phase I ( Table 2). Other candidates, like NT-proBNP, were previously tested in this cohort [7] Table 2). The remaining biomarkers were no different between the two groups. None of the biomarkers had significant differences when comparing Holter-detected AF vs no AF.
The known effect of anticoagulation in the factor IX levels [15,16] could not be eliminated from our data, as 40% of the AF patients were previously anticoagulated (mainly due to previous AF history) in comparison to only 1.3% of no AF patients. Therefore, patients on oral anticoagulants were removed from the analysis, and the difference was no longer significant (p = 0.747). For that reason, this protein was not selected to be tested in phase II.
Therefore, as shown the best results in this phase, TIMP-2 and ST-2 were selected to be tested in phase II, together with NT-proBNP, which had previously shown promising results, even to detect paroxysmal AF [7] (Figs 4 and S1).

3.2.3.
Validation. An automated immunoassay (Ella technology, Proteinsample) was used to test ST-2 in the validation phase. First, 24 samples from the verification phase were also tested with Ella technology to establish the reproducibility between this technique and the one previously used (Quantikine, R&D Systems). Both techniques showed a strong correlation (r = 0.893, p<0.001). TIMP-2 and NT-proBNP were measured with the same techniques as the previous phase. From the 3 proteins tested, none of them showed significant differences in Phase II (Table 2). Yet, NT-proBNP and ST-2 tended to have increased levels in AF. None of them showed significantly higher concentrations in Holter-detected AF (Figs 4 and S1).

Devices
From the 359 included individuals, AF was observed in 14 cases in ECG. MyDiagnostick and WatchBP were used in all the patients. AliveCor was used in the 100 individuals included in Phase I (11 with positive ECG). 13.3% of the results were not interpretable by AliveCor and assigned to No AF. Consequently, this device was substituted by Fibricheck in Phase II, which was used in the 259 remaining individuals (3 with positive ECG). The Fibricheck readings resulted in a low signal in 5 cases (1.93% of results) and were also assigned to No AF.
From the included 359 individuals, one patient was excluded from the comparisons because of the impossibility to use the devices due to left-hand amputation. Also, one patient referred pain during arterial pressure measurement due to an arteriovenous fistula that avoided Watch BP measurement. Table 3 shows the sensitivity, specificity, PPV, PNV, and AUC for each device in comparison to the conventional ECG. Although some devices were only tested in a subset of patients, WatchBP was found as the device showing the best sensitivity (84.6%) and AUC (0.895 [0.780-1]) compared to ECG, while MyDiagnostick showed the best specificity (97.10%). Also, it should be taken into account that some patients with positive results in the previous devices had AF detected during the monitoring period (Table 3).

Discussion
In the present study, we aimed to explore several tools to detect AF with the final purpose to develop an AF screening strategy that could be applied in our community. Specifically, we aimed to identify blood biomarkers associated with AF, and compare AF detection using various screening devices and long-term monitoring. As key findings, we validated NT-proBNP to identify patients with AF, but failed to found new biomarkers that could improve its performance. Also, from all the devices tested, we found that WatchBP had the best sensitivity compared to ECG as gold-standard, while MyDiagnostick had the best specificity.
These results were obtained from a cohort of 359 high-risk patients, of which 34 patients had AF. Interestingly, 24 subjects were diagnosed for the first time during the study period and would potentially benefit from anticoagulant treatment to prevent future strokes. A future follow-up of this cohort of patients will provide more data in this regard. It should be taken into account that AF prevalence in phase I was much higher than in phase II, mainly due to the inclusion of already diagnosed AF patients. These patients were included in Phase I to increase the statistical power of the biomarker discovery study. However, to avoid potential selection bias and confounding effects during the biomarker validation, those patients were excluded from phase II.
The number needed to screen to detect one new AF case was less than 15 in our study, much lower than single-time-point screening studies [17]. This is not surprising as diagnostic NPV: Negative predictive value; PPV: Positive predictive value. The first column indicates the number of AF from the total detected by each device (here we count AF cases detected by Holter monitoring). The second column indicates the number of AF from the ones detected by ECG, also detected by each device. Sensitivity, specificity, PPV, NPV and AUC were calculated using the ECG as a reference.
https://doi.org/10.1371/journal.pone.0273571.t003 yield increases with duration, number, and temporal dispersion of the screening [6]. Yet, large monitoring periods are not feasible in screening studies, at least not in every single patient. In our study, patients were mainly diagnosed during the first monitoring days and therefore, we can argue that shorter monitoring periods, for example during one week (capturing 80% of the cases) or during 2 weeks (capturing almost 90% of the cases), would be a good option in this setting. In fact, in the screening context, it has been difficult to convince patients to wear a Holter device for many days. Although few adverse events were reported during the use of the device, some patients complained regarding its discomfort, especially in the summer period, and this was one of the main reasons for denying participation in the study. In the study of Zöga et al [6], when targeting participants with risk factors (older, male, and with higher NT-proBNP) the diagnostic yield of all methodologies increased [6]. Therefore, prioritizing the identification of high-risk individuals to follow for short monitoring periods, even repeated over time, would probably increase patient's adherence and satisfaction, and decrease costs and workload, while identifying a high proportion of AF cases.
In our study we have already targeted a high-risk population, focusing on hypertensive and diabetic patients of advanced age, conditions that increase the AF prevalence [18]. However, the application of a clinical risk model in this group of patients will optimize even more the selection of candidates to screen. Results have been previously published on the possible relationship between the clinical profile and the risk of AF [19].
As stated before, we confirmed the usefulness of NT-proBNP to detect AF patients in our cohort, but its use was limited in identifying paroxysmal AF cases. Moreover, specificity and positive predictive values were low when aiming a sensitive cut-off (e.g 95 pg/ml or 125 pg/ ml). NT-proBNP >125pg/ml was used as a selection tool in the STROKESTOP II study [20] showing higher sensitivity and similar specificity to diagnose AF. However, in their study, as the low-risk group was only screened with an ECG recording the biomarker sensitivity to detect paroxysmal AF could not be tested. Although NT-proBNP is a good candidate, new biomarkers that could complement and improve its predictive accuracy are needed. With this aim, we performed a comprehensive assessment of blood biomarkers by proteomic arrays. However, we should highlight the limitations and difficulties of "discovery" designs in a complex pathology as AF, in which the broad categories under which we classify and compare the patients are not clearly defined. First, we cannot discard the misclassification of subjects in the no AF group. Following the data of Zöga et al [6], 30 days of monitoring would only have identified 34% of patients and some of our records were even shorter. Second, the natural history of AF and the hypothesis that AF may be just a bystander of an underlying disease process called atrial cardiomyopathy that confers stroke susceptibility should be taken into account [21,22]. Therefore, some individuals without AF may have an "AF substrate" with equal stroke risk and be biologically similar to the AF group. All together, these facts add background noise in discovery experiments, especially when performed in a small number of samples. These may be some of the reasons why we have obtained poor results in verifying the discovery study. Another reason may be the inclusion of permanent AF cases in that discovery. This may have revealed proteins increased in permanent AF but not in paroxysmal AF, which are the most interesting cases to identify with biomarkers due to the difficulty to be detected by other methodologies. From all the biomarkers of the discovery experiment, ST-2 and TIMP-2 were selected as the most promising biomarkers. ST2 is a receptor for interleukin-33 and its soluble form, which is released from the myocardium and vascular endothelial cells in response to pressure or volume overload, had been proposed as a promising biomarker for heart failure [23]. It had also been associated with AF, especially regarding its progression [24]. TIMP-2 is an inhibitor of matrix metalloproteinases and its circulating levels in AF patients in comparison to sinus rhythm are controversial [25,26]. Nevertheless, none of these biomarkers were validated in Phase II of the study, probably due to the higher amount of paroxysmal AF cases, which these biomarkers were not able to detect. From both biomarkers, ST-2 showed a trend to be elevated in AF patients and seemed an interesting biomarker but it did not improve the performance of NT-proBNP. Previous discovery experiments have described other biomarkers related to incident or prevalent AF but their use in screening protocols needs to be further assessed [27][28][29].
Regarding the AF detection devices, in our study, we tested the performance of four devices (MyDiagnostick, WatchBP, AliveCor, and Fibricheck) using different technologies: oscillometry, photoplethysmography, and handled-ECG. It should be noted that the comparison may have been affected by the fact that we echanged one of the devices from Phase I to Phase II, and, therefore, the sample used for all the devices was not the same. Particularlly, the difference in the number of AF cases detected in each phase may have influenced the device's diagnostic performance measures. For example, the use of Fibricheck uniquely during Phase II, with only 3 patients detected by baseline ECG, biased the calculation of its sensitivity. As a result, the no detection of one of these three patients gave a sensitivity of 33%, which is not accurate and makes difficult its comparison. Regarding the other devices, despite a relatively small sample size, our study showed similar sensitivity and specificity rates to previous studies when compared to ECG [5]. In fact, the meta-analysis performed by Taggar et al [30], stated that blood pressure monitors like WatchBP showed the highest values of sensitivity to detect AF while specificity was higher in non-12-lead-ECG like MyDiagnostick and AliveCor. Nevertheless, sensitivity rates observed seemed insufficient for routine use in clinical practice, at least for a single-time screening purpose. However, most of the tested devices were not intended for single-time use, but their best performance may be obtained after several sequential registers at different time points, even by the patient itself after symptoms presentation. According to our experience, WatchBP, with the highest sensitivity, is one of the most advantageous devices in a clinical setting for population screening strategies. Also, NICE (National Institute for Health and Care Excellence) advocates the use of WatchBP for the detection of AF in patients already being monitored for hypertension taking advantage of the dual functionality of the device [31]. On the other hand, the benefits of MyDiagnostick, AliveCor, and Fibricheck might be higher for patients' self-monitoring. As ECG confirmation is mandatory by guidelines for the diagnosis of AF, handled ECG devices that provide a verifiable ECG trace, like MyDiagnostick and AliveCor, present a clear advantatge [4].
Although the results obtained in the present project are not accurate enough to implement a screening program, the collaboration of a multidisciplinary team integrating primary care physicians, cardiologists, and researchers working together, provided useful lessons to take into account in primary care AF management and future study designs. Taking into account all our results the use of a clinical predictive model in combination with screening devices (e.g WatchBP or MyDiagnostick) and biomarkers (e.g.NT-proBNP) would probably be useful to select high-risk patients to be monitored for AF detection. Then, selected patients could be monitored continuously during short periods with a wearable Holter device (e.g. 7 days). In our experience, primary care physicians could be in charge of the monitoring with the collaboration of cardiologists to interpret the records. Another option would be the patient self-monitoring using devices or mobile applications (e.g. Fibricheck, AliveCor, MyDiagnostick). The cost-effectiveness of a screening design like that needs to be further evaluated.
Apart from the limitations discussed until now, the main limitation of our study was the reduced sample size. Although the final sample size was inside the sample calculation window, we aimed to a larger sample size in Phase II, corresponding to the upper limit of sample size calculation (n = 282). However, the initiation of the Covid-19 pandemic and the finalization of the financial period for the study limited the inclusion. More important, even though a large number of patients were included, the prospective nature of the study limited the number of AF detected. The limited number of AF patients may prevent generalizability of our results and should be further confirmed. The sample size limitation was particularly important in analyses performed in a subset of patients, like the discovery biomarker experiment. Also, due to some technical problems with the wearable Holter devices, some patients had to be excluded from some analyses, reducing, even more, the statistical power. Finally, the participant selection was not randomized and might have suffered selection bias (eg. healthy user bias, or patients with more cardiac pathologies being more prone to partici-pate). However, this cohort represents a population that would participate in an AF screening study.

Conclusions
The inclusion and monitoring of a cohort of primary care patients for AF detection, together with the testing of biomarkers and screening devices provided useful lessons about AF screening in our community, despite the limited results obtained. An AF screening strategy using rhythm detection devices and short monitoring periods among high-risk patients with high NT-proBNP levels could be feasible.
Moreover, the present results will contribute to the AFFECT-EU initiative in which information from different European screening studies will permit estimating and drawing conclusions on the efficacy of different screening methods and strategies [32].

S1 Fig. Boxplot distribution of the 3 proteins selected for validation (NT-proBNP, ST-2 and TIMP-2) according to the methodology used for AF diagnosis.
(TIF) S1 Table. Top table of the differential expressed proteins between AF and no AF. Results from the discovery study. Proteins selected to be verified in the whole Phase 1 are highlighted in grey. Proteins in bold but not highlighted were already tested in the whole phase 1 as part of a previous published work [7]. (DOCX) S1 Dataset. All patient data and biomarker measurements.